The importance of the normality assumption in large public health data sets.

نویسندگان

  • Thomas Lumley
  • Paula Diehr
  • Scott Emerson
  • Lu Chen
چکیده

It is widely but incorrectly believed that the t-test and linear regression are valid only for Normally distributed outcomes. The t-test and linear regression compare the mean of an outcome variable for different subjects. While these are valid even in very small samples if the outcome variable is Normally distributed, their major usefulness comes from the fact that in large samples they are valid for any distribution. We demonstrate this validity by simulation in extremely non-Normal data. We discuss situations in which in other methods such as the Wilcoxon rank sum test and ordinal logistic regression (proportional odds model) have been recommended, and conclude that the t-test and linear regression often provide a convenient and practical alternative. The major limitation on the t-test and linear regression for inference about associations is not a distributional one, but whether detecting and estimating a difference in the mean of the outcome answers the scientific question at hand.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of Ordinal Response Modeling Methods like Decision Trees, Ordinal Forest and L1 Penalized Continuation Ratio Regression in High Dimensional Data

Background: Response variables in most medical and health-related research have an ordinal nature. Conventional modeling methods assume predictor variables to be independent, and consider a large number of samples (n) compared to the number of covariates (p). Therefore, it is not possible to use conventional models for high dimensional genetic data in which p > n. The present study compared th...

متن کامل

Fixed point theorems for generalized quasi-contractions in cone $b$-metric spaces over Banach algebras without the assumption of normality with applications

In this paper, we introduce the concept of generalized quasi-contractions in the setting of cone $b$-metric spaces over Banach algebras. By omitting the  assumption of normality we establish common fixed point theorems for the generalized quasi-contractions  with the spectral radius $r(lambda)$ of the quasi-contractive constant vector $lambda$ satisfying $r(lambda)in [0,frac{1}{s})$  in the set...

متن کامل

A New Bootstrap Based Algorithm for Hotelling’s T2 Multivariate Control Chart

Normality is a common assumption for many quality control charts. One should expect misleading results once this assumption is violated. In order to avoid this pitfall, we need to evaluate this assumption prior to the use of control charts which require normality assumption. However, in certain cases either this assumption is overlooked or it is hard to check. Robust control charts and bootstra...

متن کامل

A New Nonparametric Regression for Longitudinal Data

In many area of medical research, a relation analysis between one response variable and some explanatory variables is desirable. Regression is the most common tool in this situation. If we have some assumptions for such normality for response variable, we could use it. In this paper we propose a nonparametric regression that does not have normality assumption for response variable and we focus ...

متن کامل

Not Up for Discussion: Applying Lukes’ Power Model to the Study of Health System Corruption; Comment on “We Need to Talk About Corruption in Health Systems”

This companion paper suggests the potential benefits of applying Steven Lukes’ dimensions of power model to the study of corruption in health systems. Lukes’ model sets out three “faces of power” classified by their influence on political discourse, resulting in overt, covert and latent discussion of issues depending on the degree of their alignment with the agenda of d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Annual review of public health

دوره 23  شماره 

صفحات  -

تاریخ انتشار 2002